Overview

Dataset info

Number of variables9
Number of observations423909
Missing cells52288 (1.4%)
Duplicate rows72 (< 0.1%)
Total size in memory188.7 MiB
Average record size in memory466.8 B

Variables types

CAT5
NUM3
BOOL1

Reproduction info

Date of analysis2020-02-22 16:00:10.735534
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

Dataset has 72 (< 0.1%) duplicate rows Warning
addr has a high cardinality: 36051 distinct values Warning
desc has a high cardinality: 423837 distinct values Warning
e has constant value "1" Rejected
lat is highly skewed (γ1 = -100.8720036) Skewed
lng is highly skewed (γ1 = 198.1015054) Skewed
timeStamp only contains datetime values, but is categorical. Consider applying pd.to_datetime()Type
timeStamp has a high cardinality: 409544 distinct values Warning
title has a high cardinality: 141 distinct values Warning
twp has a high cardinality: 69 distinct values Warning
zip has 52129 (12.3%) missing values Missing

Variables

addr
Categorical

HIGH CARDINALITY
Distinct count36051
Unique (%)8.5%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
SHANNONDELL DR & SHANNONDELL BLVD
 
4328
MAIN ST & OLD SUMNEYTOWN PIKE
 
1595
THE FAIRWAY & RYDAL RD
 
1288
EVERGREEN RD & W LIGHTCAP RD
 
1020
EAGLEVILLE RD & SUNDERLAND DR
 
1006
Other values (36046)
414672
ValueCountFrequency (%) 
SHANNONDELL DR & SHANNONDELL BLVD 4328 1.0%
 
MAIN ST & OLD SUMNEYTOWN PIKE 1595 0.4%
 
THE FAIRWAY & RYDAL RD 1288 0.3%
 
EVERGREEN RD & W LIGHTCAP RD 1020 0.2%
 
EAGLEVILLE RD & SUNDERLAND DR 1006 0.2%
 
SCHUYLKILL EXPY & WEADLEY RD OVERPASS 953 0.2%
 
GULPH RD & KIRK AVE 946 0.2%
 
BLACK ROCK RD & S TRAPPE RD 932 0.2%
 
DAVISVILLE RD & PENNYPACK RD 883 0.2%
 
SCHUYLKILL EXPY & CONSHOHOCKEN STATE UNDERPASS 870 0.2%
 
Other values (36041) 410088 96.7%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length69
Mean length25.40654952
Min length1
Scatter

desc
Categorical

HIGH CARDINALITY
Distinct count423837
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
WALDEN POND WAY & WOODVIEW LN; TOWAMENCIN; 2017-09-27 @ 17:54:55-Station:STA76;
 
4
GERMANTOWN PIKE; WORCESTER; 2017-09-27 @ 17:50:24-Station:STA83;
 
4
GREEN ST & E BASIN ST; NORRISTOWN; Station 308A; 2016-05-10 @ 20:23:26;
 
4
N GULPH RD; UPPER MERION; 2015-12-12 @ 21:09:39-Station:STA47;
 
3
HUNTINGDON RD & SUSQUEHANNA RD; ABINGTON; Station 381; 2017-09-27 @ 17:58:40;
 
3
Other values (423832)
423891
ValueCountFrequency (%) 
WALDEN POND WAY & WOODVIEW LN; TOWAMENCIN; 2017-09-27 @ 17:54:55-Station:STA76; 4 < 0.1%
 
GERMANTOWN PIKE; WORCESTER; 2017-09-27 @ 17:50:24-Station:STA83; 4 < 0.1%
 
GREEN ST & E BASIN ST; NORRISTOWN; Station 308A; 2016-05-10 @ 20:23:26; 4 < 0.1%
 
N GULPH RD; UPPER MERION; 2015-12-12 @ 21:09:39-Station:STA47; 3 < 0.1%
 
HUNTINGDON RD & SUSQUEHANNA RD; ABINGTON; Station 381; 2017-09-27 @ 17:58:40; 3 < 0.1%
 
ASTOR ST & W OAK ST; NORRISTOWN; Station 308A; 2017-09-27 @ 17:44:37; 3 < 0.1%
 
SUSQUEHANNA RD & E BUTLER PIKE; UPPER DUBLIN; 2016-05-13 @ 12:15:59; 3 < 0.1%
 
DOCK DR & DETWILER RD; TOWAMENCIN; Station 345B; 2015-12-13 @ 10:24:26; 2 < 0.1%
 
MT CARMEL AVE & ROBERTS AVE; CHELTENHAM; Station 358A; 2016-06-06 @ 15:48:23; 2 < 0.1%
 
SWINGING BRIDGE RD & BROOMSTICK RD; UPPER HANOVER; Station 369; 2016-06-01 @ 16:34:27; 2 < 0.1%
 
Other values (423827) 423879 > 99.9%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length120
Mean length71.62109557
Min length32
Scatter

e
Boolean

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
1
423909
ValueCountFrequency (%) 
1 423909 100.0%
 

lat
Real number (ℝ≥0)

SKEWED
Distinct count22661
Unique (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40.15861796
Minimum0
Maximum51.3353899
Zeros1
Zeros (%)< 0.1%
Memory size3.2 MiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile40.015667
Q140.0997844
median40.1439004
Q340.2290075
95-th percentile40.3083454
Maximum51.3353899
Range51.3353899
Interquartile range (IQR)0.1292231

Descriptive statistics

Standard deviation0.1291702851
Coefficient of variation (CV)0.003216502253
Kurtosis26455.59919
Mean40.15861796
Median Absolute Deviation (MAD)0.07037184082
Skewness-100.8720036
Sum17023599.58
Variance0.01668496256
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 0. 39.08012035 39.85565335 39.9512759 39.97672825 ... 40.4507349 40.4576575 40.493129 41.18523905 51.3353899 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
40.0972222 6243 1.5%
 
40.1330371 4328 1.0%
 
40.0249667 3851 0.9%
 
40.2290075 3835 0.9%
 
40.1723141 2085 0.5%
 
40.1082672 2014 0.5%
 
40.0698321 1993 0.5%
 
40.1532684 1913 0.5%
 
40.2890267 1646 0.4%
 
40.0812601 1544 0.4%
 
Other values (22651) 394457 93.1%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
22.9867569 5 < 0.1%
 
26.820553 1 < 0.1%
 
30.333596 1 < 0.1%
 
32.3792233 1 < 0.1%
 
ValueCountFrequency (%) 
51.3353899 2 < 0.1%
 
46.5653163 1 < 0.1%
 
41.2033216 3 < 0.1%
 
41.1671565 2 < 0.1%
 
40.9694305 6 < 0.1%
 

lng
Real number (ℝ)

SKEWED
Distinct count22680
Unique (%)5.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-75.31402226
Minimum-119.6982057
Maximum87.8549755
Zeros1
Zeros (%)< 0.1%
Memory size3.2 MiB
Mini histogram

Quantile statistics

Minimum-119.6982057
5-th percentile-75.6313032
Q1-75.3915474
median-75.3045627
Q3-75.2107603
95-th percentile-75.1022513
Maximum87.8549755
Range207.5531812
Interquartile range (IQR)0.1807871

Descriptive statistics

Standard deviation0.6560476629
Coefficient of variation (CV)-0.008710830244
Kurtosis48196.49659
Mean-75.31402226
Median Absolute Deviation (MAD)0.1218555596
Skewness198.1015054
Sum-31926291.86
Variance0.430398536
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[-119.6982057 -80.2589274 -75.9911789 -75.73093885 -75.71755175 ... -75.0181556 -74.99405845 -74.81187245 -74.3018827 87.8549755 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-75.3761952 6243 1.5%
 
-75.4084631 4328 1.0%
 
-75.2829046 4239 1.0%
 
-75.3878525 3836 0.9%
 
-75.2362381 2833 0.7%
 
-75.4927278 2088 0.5%
 
-75.3062326 2014 0.5%
 
-75.3162951 1993 0.5%
 
-75.1895576 1913 0.5%
 
-75.3995896 1646 0.4%
 
Other values (22670) 392776 92.7%
 
ValueCountFrequency (%) 
-119.6982057 5 < 0.1%
 
-95.712891 1 < 0.1%
 
-95.5955947 1 < 0.1%
 
-86.3077368 1 < 0.1%
 
-86.276106 1 < 0.1%
 
ValueCountFrequency (%) 
87.8549755 5 < 0.1%
 
30.802498 1 < 0.1%
 
0 1 < 0.1%
 
-0.742856 2 < 0.1%
 
-66.4619164 1 < 0.1%
 

timeStamp
Categorical

TYPE DATE
HIGH CARDINALITY
Distinct count409544
Unique (%)96.6%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
2018-10-06 19:26:38
 
9
2018-07-09 13:23:16
 
8
2016-07-20 06:42:37
 
6
2016-07-10 19:00:45
 
6
2017-02-09 07:17:58
 
6
Other values (409539)
423874
ValueCountFrequency (%) 
2018-10-06 19:26:38 9 < 0.1%
 
2018-07-09 13:23:16 8 < 0.1%
 
2016-07-20 06:42:37 6 < 0.1%
 
2016-07-10 19:00:45 6 < 0.1%
 
2017-02-09 07:17:58 6 < 0.1%
 
2016-06-01 16:34:27 6 < 0.1%
 
2018-11-15 13:25:21 5 < 0.1%
 
2017-05-29 12:22:45 5 < 0.1%
 
2015-12-14 21:37:01 5 < 0.1%
 
2017-04-10 16:00:12 5 < 0.1%
 
Other values (409534) 423848 > 99.9%
 

Composition

Contains charsFalse
Contains digitsTrue
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length19
Mean length19
Min length19
Scatter

title
Categorical

HIGH CARDINALITY
Distinct count141
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.2 MiB
Traffic: VEHICLE ACCIDENT -
98401
Traffic: DISABLED VEHICLE -
 
31871
Fire: FIRE ALARM
 
24380
EMS: FALL VICTIM
 
21253
EMS: RESPIRATORY EMERGENCY
 
21158
Other values (136)
226846
ValueCountFrequency (%) 
Traffic: VEHICLE ACCIDENT - 98401 23.2%
 
Traffic: DISABLED VEHICLE - 31871 7.5%
 
Fire: FIRE ALARM 24380 5.8%
 
EMS: FALL VICTIM 21253 5.0%
 
EMS: RESPIRATORY EMERGENCY 21158 5.0%
 
EMS: CARDIAC EMERGENCY 20616 4.9%
 
EMS: VEHICLE ACCIDENT 16928 4.0%
 
Traffic: ROAD OBSTRUCTION - 14134 3.3%
 
EMS: SUBJECT IN PAIN 12001 2.8%
 
EMS: HEAD INJURY 11102 2.6%
 
Other values (131) 152065 35.9%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length36
Mean length22.72649083
Min length10
Scatter

twp
Categorical

HIGH CARDINALITY
Distinct count69
Unique (%)< 0.1%
Missing159
Missing (%)< 0.1%
Memory size3.2 MiB
LOWER MERION
 
36441
ABINGTON
 
25835
NORRISTOWN
 
23883
UPPER MERION
 
22694
CHELTENHAM
 
19629
Other values (63)
295268
ValueCountFrequency (%) 
LOWER MERION 36441 8.6%
 
ABINGTON 25835 6.1%
 
NORRISTOWN 23883 5.6%
 
UPPER MERION 22694 5.4%
 
CHELTENHAM 19629 4.6%
 
POTTSTOWN 17500 4.1%
 
UPPER MORELAND 14707 3.5%
 
LOWER PROVIDENCE 14025 3.3%
 
PLYMOUTH 12800 3.0%
 
UPPER DUBLIN 11910 2.8%
 
Other values (58) 224326 52.9%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length17
Mean length11.07682781
Min length3
Scatter

zip
Real number (ℝ≥0)

MISSING
Distinct count171
Unique (%)< 0.1%
Missing52129
Missing (%)12.3%
Infinite0
Infinite (%)0.0%
Mean19234.73227
Minimum3366
Maximum77316
Zeros0
Zeros (%)0.0%
Memory size3.2 MiB
Mini histogram

Quantile statistics

Minimum3366
5-th percentile18969
Q119038
median19401
Q319446
95-th percentile19468
Maximum77316
Range73950
Interquartile range (IQR)408

Descriptive statistics

Standard deviation301.3888975
Coefficient of variation (CV)0.01566899364
Kurtosis3799.396331
Mean19234.73227
Median Absolute Deviation (MAD)228.2078512
Skewness16.73511008
Sum7151088763
Variance90835.26755
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
19401 28656 6.8%
 
19464 27948 6.6%
 
19403 21631 5.1%
 
19446 20496 4.8%
 
19406 14097 3.3%
 
19002 13380 3.2%
 
19468 12393 2.9%
 
19046 11720 2.8%
 
19454 11457 2.7%
 
19090 11149 2.6%
 
Other values (160) 198853 46.9%
 
(Missing) 52129 12.3%
 
ValueCountFrequency (%) 
3366 1 < 0.1%
 
7203 1 < 0.1%
 
8033 1 < 0.1%
 
8065 1 < 0.1%
 
8361 3 < 0.1%
 
ValueCountFrequency (%) 
77316 1 < 0.1%
 
36107 1 < 0.1%
 
23005 1 < 0.1%
 
21701 1 < 0.1%
 
19607 1 < 0.1%
 

Correlations

Missing values

Sample

First rows

addrdescelatlngtimeStamptitletwpzip
0REINDEER CT & DEAD ENDREINDEER CT & DEAD END; NEW HANOVER; Station 332; 2015-12-10 @ 17:10:52;140.297876-75.5812942015-12-10 17:10:52EMS: BACK PAINS/INJURYNEW HANOVER19525.0
1BRIAR PATH & WHITEMARSH LNBRIAR PATH & WHITEMARSH LN; HATFIELD TOWNSHIP; Station 345; 2015-12-10 @ 17:29:21;140.258061-75.2646802015-12-10 17:29:21EMS: DIABETIC EMERGENCYHATFIELD TOWNSHIP19446.0
2HAWS AVEHAWS AVE; NORRISTOWN; 2015-12-10 @ 14:39:21-Station:STA27;140.121182-75.3519752015-12-10 14:39:21Fire: GAS-ODOR/LEAKNORRISTOWN19401.0
3AIRY ST & SWEDE STAIRY ST & SWEDE ST; NORRISTOWN; Station 308A; 2015-12-10 @ 16:47:36;140.116153-75.3435132015-12-10 16:47:36EMS: CARDIAC EMERGENCYNORRISTOWN19401.0
4CHERRYWOOD CT & DEAD ENDCHERRYWOOD CT & DEAD END; LOWER POTTSGROVE; Station 329; 2015-12-10 @ 16:56:52;140.251492-75.6033502015-12-10 16:56:52EMS: DIZZINESSLOWER POTTSGROVENaN
5CANNON AVE & W 9TH STCANNON AVE & W 9TH ST; LANSDALE; Station 345; 2015-12-10 @ 15:39:04;140.253473-75.2832452015-12-10 15:39:04EMS: HEAD INJURYLANSDALE19446.0
6LAUREL AVE & OAKDALE AVELAUREL AVE & OAKDALE AVE; HORSHAM; Station 352; 2015-12-10 @ 16:46:48;140.182111-75.1277952015-12-10 16:46:48EMS: NAUSEA/VOMITINGHORSHAM19044.0
7COLLEGEVILLE RD & LYWISKI RDCOLLEGEVILLE RD & LYWISKI RD; SKIPPACK; Station 336; 2015-12-10 @ 16:17:05;140.217286-75.4051822015-12-10 16:17:05EMS: RESPIRATORY EMERGENCYSKIPPACK19426.0
8MAIN ST & OLD SUMNEYTOWN PIKEMAIN ST & OLD SUMNEYTOWN PIKE; LOWER SALFORD; Station 344; 2015-12-10 @ 16:51:42;140.289027-75.3995902015-12-10 16:51:42EMS: SYNCOPAL EPISODELOWER SALFORD19438.0
9BLUEROUTE & RAMP I476 NB TO CHEMICAL RDBLUEROUTE & RAMP I476 NB TO CHEMICAL RD; PLYMOUTH; 2015-12-10 @ 17:35:41;140.102398-75.2914582015-12-10 17:35:41Traffic: VEHICLE ACCIDENT -PLYMOUTH19462.0

Last rows

addrdescelatlngtimeStamptitletwpzip
423899W MORELAND RD & KIMBALL AVEW MORELAND RD & KIMBALL AVE; UPPER DUBLIN; 2018-11-16 @ 08:38:23;140.149952-75.1345372018-11-16 08:38:23Traffic: VEHICLE ACCIDENT -UPPER DUBLIN19090.0
423900HOFFMANHOFFMAN ; AMBLER; 2018-11-16 @ 08:46:25;140.157913-75.2036322018-11-16 08:46:25Traffic: DISABLED VEHICLE -AMBLER19002.0
423901SHANNONDELL DR & SHANNONDELL BLVDSHANNONDELL DR & SHANNONDELL BLVD; LOWER PROVIDENCE; Station 322A; 2018-11-16 @ 08:53:03;140.133037-75.4084632018-11-16 08:53:03EMS: MEDICAL ALERT ALARMLOWER PROVIDENCE19403.0
423902RICHARDSON RD & COUNTY LINE RDRICHARDSON RD & COUNTY LINE RD; MONTGOMERY; 2018-11-16 @ 08:55:36;140.271663-75.2384402018-11-16 08:55:36Traffic: DISABLED VEHICLE -MONTGOMERY18914.0
423903HORSHAM RD & STUMP RDHORSHAM RD & STUMP RD; MONTGOMERY; 2018-11-16 @ 08:54:52;140.235373-75.2247512018-11-16 08:54:52Traffic: DISABLED VEHICLE -MONTGOMERY19454.0
423904BUCK RD & WOODWARD DRBUCK RD & WOODWARD DR; LOWER MORELAND; 2018-11-16 @ 08:54:08;140.139993-75.0498642018-11-16 08:54:08Traffic: VEHICLE ACCIDENT -LOWER MORELAND19006.0
423905OAK DR & MOYER RDOAK DR & MOYER RD; LOWER SALFORD; 2018-11-16 @ 08:53:32;140.270121-75.3828252018-11-16 08:53:32Traffic: VEHICLE ACCIDENT -LOWER SALFORD19438.0
423906OAK DR & MOYER RDOAK DR & MOYER RD; LOWER SALFORD; 2018-11-16 @ 08:54:19;140.270121-75.3828252018-11-16 08:54:19Traffic: VEHICLE ACCIDENT -LOWER SALFORD19438.0
423907SUMNEYTOWN PIKE & WELLINGTON DRSUMNEYTOWN PIKE & WELLINGTON DR; LOWER GWYNEDD; 2018-11-16 @ 08:51:48;140.190946-75.2372852018-11-16 08:51:48Traffic: VEHICLE ACCIDENT -LOWER GWYNEDD19002.0
423908HOFFMANHOFFMAN ; LOWER GWYNEDD; 2018-11-16 @ 08:46:25;140.155164-75.2646652018-11-16 08:46:25Traffic: DISABLED VEHICLE -LOWER GWYNEDD19422.0